A model based voice activity detector for noisy environments

نویسندگان

  • Kaavya Sriskandaraja
  • Vidhyasaharan Sethu
  • Phu Ngoc Le
  • Eliathamby Ambikairajah
چکیده

This paper presents a model-based voice activity detector (VAD) aimed at operating in low signal to noise ratio conditions and non-stationary noise environments. The proposed system makes use of Gaussian mixture models trained on Mel Frequency Cepstral Coefficients extracted from noisy speech data. In addition, information from smoothed frame based log energy is used to augment the system to detect voice activity accurately. Finally, preliminary decisions made by the system are post processed to remove some false acceptances which further improves the system performance. Experimental results show that the proposed VAD significantly outperforms the system that currently produces state-of-the-art results on the QUT-NOISE-TIMIT database with relative improvements of 34.58%, 17.18% and 3.5% for high, medium and low signal to noise ratio scenarios respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)

Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...

متن کامل

Voice Activity Detection in Noisy Environments Based on Double-Combined Fourier Transform and Line Fitting

A new voice activity detector for noisy environments is proposed. In conventional algorithms, the endpoint of speech is found by applying an edge detection filter that finds the abrupt changing point in a feature domain. However, since the frame energy feature is unstable in noisy environments, it is difficult to accurately find the endpoint of speech. Therefore, a novel feature extraction algo...

متن کامل

Voice Activity Detector and Noise Trackers for Speech Recognition System in Noisy Environment

The well known fact is that the performance of the Speech Recognition System degrades drastically in Adverse Environments. Additive noise is one of the major element of adverse noisy environment. Detecting voiced, un-voiced or silent speech signal in noisy environment is not an easy task. A voice activity detector (VAD) is effective when the noise is stationary; it often fails when the noise st...

متن کامل

Noise robust voice activity detection based on switching kalman filter

This paper addresses the problem of voice activity detection (VAD) in noisy environments. The VAD method proposed in this paper is based on a statistical model approach, and estimates statistical models sequentially without a priori knowledge of noise. Namely, the proposed method constructs a clean speech / silence state transition model beforehand, and sequentially adapts the model to the nois...

متن کامل

Noise Estimation for Speech Enhancement in Non-Stationary Environments-A New Method

This paper presents a new method for estimating the nonstationary noise power spectral density given a noisy signal. The method is based on averaging the noisy speech power spectrum using time and frequency dependent smoothing factors. These factors are adjusted based on signal-presence probability in individual frequency bins. Signal presence is determined by computing the ratio of the noisy s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015